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Abstract 

We are given n base elements and a finite collection of subsets of them. The size of any 
subset varies between p to k (p < k). In addition, we assume that the input contains all 
possible subsets of size p. Our objective is to find a subcollection of minimum-cardinality 
which covers all the elements. This problem is known to be NP-hard. We provide two 
approximation algorithms for it, one for the generic case, and an improved one for the 
special case of (p, k) = (2, 4). 

The algorithm for the generic case is a greedy one, based on packing phases: at each 
phase we pick a collection of disjoint subsets covering i new elements, starting from i = k 
down to i = p + 1. At a final step we cover the remaining base elements by the subsets of 
size p. We derive the exact performance guarantee of this algorithm for all values of k and 
p, which is less than Hk, where Hk is the /c'th harmonic number. However, the algorithm 
exhibits the known improvement methods over the greedy one for the unweighted fc-set 
cover problem (in which subset sizes are only restricted not to exceed k), and hence it 
serves as a benchmark for our improved algorithm. 

The improved algorithm for the special case of (p, k) = (2, 4) is based on non-oblivious 
local search: it starts with a feasible cover, and then repeatedly tries to replace sets of 
size 3 and 4 so as to maximize an objective function which prefers big sets over small 
ones. For this case, our generic algorithm achieves an asymptotic approximation ratio 
of 1.5 + e, and the local search algorithm achieves a better ratio, which is bounded by 
1.458333... + e. 
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1 Introduction 

In the unweighted set cover problem, we are given n base elements and a finite collection of 
subsets of them. Our objective is to find a cover, i.e., a subcollection of subsets which covers 
all the elements, of minimum-cardinality. This problem has applications in diverse contexts 
such as efficient testing, statistical design of experiments, crew scheduling for airlines, and it 
also arises as a subproblem of many integer programming problems. For more information, 
see, e.g., [13], Chapter 3. 
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When we consider instances of unweighted set cover such that each subset has at most 
k elements, we obtain the unweighted k-set cover problem. This problem is known to be 
NP-complete [32], and it is MAX SNP-hard for all k > 3 [22j El [L8] . 

It is well known (see [5]) that a greedy algorithm is an //^-approximation algorithm for 
unweighted k-set cover, where = J2i=i J ls the fc'th harmonic number and that this bound 
is tight [TBI [20]. For unbounded values of k, Slavfk [25] showed that the approximation ratio 
of the greedy algorithm for unweighted set cover is Inn — In Inn + 6(1). Feige [8] proved 
that unless NP C DTIME(n polyl ° 9 n ), unweighted set cover cannot be approximated within 
a factor (1 — e) Inn for any e > 0. Raz and Safra [23] proved that if P ^ NP, then for some 
constant c, unweighted set cover cannot be approximated within a factor clogn. This result 
shows that the greedy algorithm is an asymptotically best possible approximation algorithm 
for this problem (unless NP C DTIME{pP o[ y l ° 9 n )). Goldschmidt, Hochbaum, and Yu [9] 
modified the greedy algorithm for unweighted k-set cover and showed that the resulting al- 
gorithm has a performance guarantee of Hk — g. Halldorsson [TU] presented an algorithm 
based on a local search that has an approximation ratio of H^ — | for unweighted k-set cover 
and a (1.4 + e)-approximation algorithm for unweighted 3-set cover. Duh and Fiirer [7] later 
improved this result and presented an (Hf. — ^-approximation algorithm for unweighted A;-set 
cover. Levin [19] improved their result and obtained an (Hk — 0.5026)-approximation algo- 
rithm for k > 4, and Athanassopoulos et al. [3] presented a further improved algorithm for 
k > 6 with approximation ratio approaching Hk — 0.5902 for large values of k. 

All of these improvements [9J [TOT [TJ [T9l [3] are essentially the greedy algorithm, with 
modifications on the way it handles small subsets. That is, they are all based on running the 
greedy algorithm until each new subset covers at most t new elements (the specific value of 
t depends on the exact algorithm), and then use a different method to cover the remaining 
base elements. 

In [14] . Hochbaum and Levin consider the problem of covering the edges of a bipartite 
graph G using a minimum number of K PtP bicliques (which need not be subgraphs of G). 
This problem arises in the context of optical networks design (see [2] ) , where p is typically 2 
or 3. In addition, it can be viewed as an instance of unweighted p 2 — set cover, where the base 
elements are G's edges, and the input collection consists of all K PtP graphs over G"s vertices. 
In that paper, they analyze the greedy algorithm applied for this special case, and show that 
it returns a solution whose cost is at most (H p 2 — H p + 1)0 PT + 1 (where OPT is the optimal 
cost). They also present an improved algorithm for the case p = 2 based on the property of 
the bipartite graph G, achieving an approximation ratio of 1.3 + e. 

If, in addition, the input collection contains some graphs that have up to k edges, k > p, 
then the resulting problem is an instance of the (p,k) -uniform unweighted set cover problem 
(see [H]), which we denote by (p, fe)-UUSC. That is, it is the variant of unweighted set cover 
where the size of every subset varies between p to k (jp < k), and the input contains all possible 
subsets of size p. In fact, their analysis of the greedy algorithm is for this generalization. Thus, 
the algorithms for unweighted k-set cover serve as a benchmark for our algorithms for this 
problem. 

Recall that the dual problem of unweighted k-set cover is the (maximum) unweighted k- 
set packing problem: We are given n base elements and a collection of subsets of them. Our 
objective is to find a packing, i.e., a subcollection of disjoint subsets, of maximum-cardinality. 
The fractional version of unweighted set packing is the dual linear program of the fractional 
version of unweighted set cover. The greedy algorithm for this problem, which returns any 
maximal subcollection of subsets, achieves an approximation ratio of \. Hurkens and Schrijver 
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|15] proved that for unweighted fc-set packing, a local search algorithm is a ^^-approximation 
algorithm. Athanassopoulos et al. [3] use this local search algorithm in each of their "packing 
phases" , and then use the method of Duh and Fiirer [7] in a final phase. 

The weighted fc-set cover problem and the weighted £;-set packing problem are defined 
analogously. However, this time each set has a cost (in the set cover variant) or a profit 
(in the set packing variant) and the goal is to minimize the total cost or to maximize the 
total profit, respectively. The greedy algorithm for the unweighted versions and the weighted 
versions have the same approximation guarantee (for each of the two problems). Hassin and 
Levin [12] improved the resulting approximation ratio for the weighted k-set cover problem 
for constant values of k, and Arkin and Hassin [2j improved the greedy algorithm for the 
weighted fc-set packing problem. 

The method of local search has been widely used in many hard combinatorial optimization 
problems. The idea is simple: start with an arbitrary (feasible) solution. At each step, search 
a (relatively small) neighborhood for an improved solution. If such a solution is found, replace 
the current solution with it. Repeat this procedure until the neighborhood (of the current 
solution) contains no improving solutions. At this point, return the current solution, which 
is locally optimal, and terminate. Observe that in order for this method to run in polynomial 
time, each local change should be computable in polynomial time, and the number of iterations 
should be polynomially bounded. 

Local search algorithms are mainly used in the framework of metaheuristics, such as 
simulated annealing, taboo search, genetic algorithms, etc. From a practical point of view, 
they are usually very efficient and achieve excellent results - the generated solutions are near 
optimal. However, from a theoretical point of view, there is usually no guarantee on the their 
worst-case performance. In the thorough survey Q], Angel reviews the main results on local 
search algorithms that have a worst-case performance guarantee. See also Halldorsson [11] 
for applications of this method to /c-dimensional matching, k-set packing, and some variants 
on independent set, vertex cover, set cover and graph coloring problems. 

In [18], Khanna et al. present the paradigm of non-oblivious local search. The idea, 
as they comment, has been implicitly used in some known algorithms such as interior-point 
methods. In that paper, they define the formal general algorithm in the context of MAX 
SNP. Then, they develop non-oblivious local search algorithms for MAX fc-SAT, and for the 
problem MAX /c-CSP which they define, which is a generalization of all the problems in 
MAX SNP. The idea in the context of set cover is as follows. Any standard (i.e., oblivious) 
local search algorithm must explicitly have the same objective: minimizing the number of 
picked sets. (Different such algorithms may look at different neighborhoods). However, a 
non-oblivious local search algorithm may have a different objective function to direct the 
search. 

Paper overview. In section [2j we present an algorithm for (p, fc)-UUSC (for any values 
of p, k). This algorithm is based on applying the best known approximation algorithm for 
set packing (described in [15]) in each of the packing phases. For (p, /c)-UUSC where p > 2, 
this algorithm exhibits all previously known methods to improve upon the greedy algorithm 
for unweighted k— set cover. Hence, this algorithm serves as a benchmark for our improved 
algorithm. For the special case of (p, k) = (2, 4) it achieves an asymptotic approximation 
ratio of 1.5 + e. In section [3l we present an improved algorithm for the case of (p, k) = (2, 4), 
which is based on non-oblivious local search, and we show that its (absolute) approximation 
ratio is at most §| + e = 1.458333... + e. In section HI we discuss some open questions. 
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2 A first approximation algorithm for (p, /c)-UUSC 



Our algorithm is described in Figure [TJ 



ALGORITHM Al 



Packing phases: 

for i = k downto p + 1 do: 

find a maximal collection of disjoint z-sets using 
a ^^-approximation algorithm. 



Phase p: 

cover the remaining base elements with disjoint p-sets. 



Figure 1: Algorithm Al. 



We analyze this algorithm using a factor revealing linear program. We assume that p > 1 , 
k > 2, k > p. We also assume that the input satisfies the subset closure property and, 
consequently, that the cover consists of disjoint subsets. Note that in explicit representation, 
this causes the input size to increase by a factor of 2 k — 1 at the most, since for each subset, 
all its non-trivial subsets are added to the collection. However, such explicit representation 
is not necessary for our algorithm, and we use it only for the analysis. Another simplifying 
assumption for the analysis is: 

Assumption 2.1 The input consists exclusively of the sets in APX and OPT. In addition, 
APX n OPT = 0. 

The justification of this assumption is fairly simple. Regarding its first part, observe that if 

the sets selected by Al in phase i cannot be improved, then this collection of i— sets cannot be 

improved by replacing some of them by subsets of OPT (or subsets of them). Hence, subsets 

outside APX U OPT can be removed. 

For the second part, observe that if there is a subset S in both APX and OPT, removing 

S and its elements from the input results in an instance for which APX \ {S} is a feasible 

solution and OPT \ {S} is an optimal solution. But the approximation ratio for this new 
■ , ■ / \APX\-1 ^ \APX\ 

instance is p = \ pt\-i — \opt\ = P- 



At any point in the execution of the algorithm, we define an i— set to be a subset of size 
i, such that all of its elements are uncovered. We define cti j to be the ratio of the number 
of j— sets in OPT in the beginning of packing phase i, to \OPT\, i = p + 1, k, j = 1, i, 
and for phase p we define a PtP to be the ratio of the number of uncovered elements in the 
beginning of phase p, to p\OPT\. 

Our analysis of Algorithm Al is similar to that of [3]. In each packing phase i (p + 1 < 
i < k) we find a collection of i— sets which is maximal. Therefore, in all of the next phases j 
ip < j < i) there are no i— sets available. Similarly, in phase p there are no i— sets available, 
i > p. Thus: 




i = p + 1, k 



(1) 



QpiP — 1 * 



(2) 
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Denote by Vi the remaining uncovered elements in the beginning of phase i, i = p, ...,k. 
By definition of Ojj, their number is \Vi\ = J2)=ij a i,j\OPT\. In packing phase i, we pick 
i— sets that cover the elements in Vi \ V%-\. Since V\-\ C Vi their number is: 

\V\V l - 1 \ = \V i \-\V i - 1 \= l^ja i:J -Y,jai-i,A\OPT\ , i = p + 1, k . (3) 

V=l 3=1 j 

At the beginning of packing phase i, there are at least a>ij\OPT\ available i— sets. Therefore, 
the approximation algorithm picks at least {^£)ai^\OPT\ i— sets, thus covering at least 
(2 — e)cii j i\OPT\ new elements. Hence, \V \ Vi-i\ > (2 — e)a,i t i\OPT\. Using ([3]) and omitting 
the e term, this yields: 

i-l i-l 

^ ioj-i^- - ^ jajj -(i- 2)ai,i < , i = p + 1, fe . (4) 

3=1 3=1 

Define ti to be the number of i— sets that are picked in packing phase i, i = p + 1, k. Then 
© yields: 

1 f lJ 1 \ 

^ = t|VJ\K-i|= \~z2j a i,3--z2j a i-ij ]\OPT\ , i = p+ l,...,k , (5) 

Z \S=1 % 3=1 ) 

and for phase p define t p as: 

<p = a PtP \OPT\ . (6) 

Note that [a PjP |OPT|] is the number of p— sets that are picked and possibly an additional 
set of size less than p, covering the remaining elements. Due to this last set, we obtain an 

E fc = u+i „ ™, 

asymptotic approximation ratio. Specifically, it is — \qpt \ — ■ Using dSJ) , (J6J) , we obtain: 



E}=pt< + 1 A iA. i£i . \ l 



3=1 i=p+l \ v ' 3=1 



Thus, maximizing the right-hand side of (|7|) subject to the constraints (HI),©, Q and 
a i,j > 0, yields an upper-bound on the approximation ratio of Algorithm Al. Observe that 
asymptotically, the term njpm is arbitrarily small. For convenience, since it is a constant in 
the objective function, we omit it. The resulting LP is: 

Program (_P) 

max | T,j=i 3 a k,j + EiLp+i (wry Sj=i J a i,j) + ^h" a p,p 
s.t. Ei=iOij<l »=p+l,...,A; (8) 

a p , p < 1 (9) 
Ej=i jai-ij- - £pi jay - (i - 2)a ¥ < i = p + 1, fc (10) 

dij > i =p, k, j = 1, . 

It is possible to derive a closed-form solution for this LP. 
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Theorem 2.1 The solution of program (P) is given by: 



• Case 1: k — p even: a p+2 j+i, P +2j = a p+2j , p+2j = 1 for all j = 0, k \ 2 , a k ,k = 1, 
and all other aij 's are zeros. 

• Case 2: k - p odd: a p+2 j+i, P +2j = a p+2jtP+2j = 1 for all j = 0, fc ~f~ 3 , a k:k = 
a k-i,k~2 = 1; an d all other 's are zeros. 

Al is an asymptotic (p + e)- approximation algorithm for (p,k)-UUSC, where p is (P)'s ob- 
jective function value, and is given by: 

Hk — Hp +1 p even, k even 

2 2 

H fc-i — He + 1 + \ — V even, k odd 

2(H k - H p+ i) - H% + H v +x + 1 + \ — k (k-i) P °dd> k even 

2(H k+ i — H„+i) — Hk+i + Hp+i +1 p odd, k odd . 

2 2 

The proof is technical, and can be found in the Appendix. This is an asymptotic approxima- 
tion ratio due to the rQpm term which we neglected. 

Corollary 2.1 Al is an asymptotic (1.5 + e)- approximation algorithm for (2,4)-UUSC. 



3 An improved algorithm for (2, 4)-UUSC 

In this section, we describe an improved algorithm for the case (p, k) = (2, 4). That is, subsets' 
sizes are between 2 to 4, and all possible 2— sets are available. Our algorithm is based on 
a non-oblivious local search. Specifically, denote by X 2 ^X^,X^, the number of 2,3, 4-sets in 
APX, respectively. Then the number of base elements is n = 2X 2 + 3X3 + 4X4 and the set 
cover objective is to minimize X 2 + X3 + X4. However, the objective of our algorithm is to 
maximize 4X^ + X3. This is equivalent to minimize X 2 + X3. Intuitively, the large sets are 
given higher priority because a cover which consists of many large sets is good (due to the 
disjointness assumption). Observe that this objective function is related to that of packing 
problems, which are the dual of covering problems. Our local search algorithm is described 
in Figure [21 It is parameterized by e, which we assume to be small enough, say e < and 
in addition, without loss of generality we assume that - is an integer. 

ALGORITHM A2 

1. Start with an arbitrary feasible cover. 

2. Perform a local search improvement step: 

remove up to - 3— and 4— sets, 

insert any number of 3— and 4— sets, so as to maximize 4A4 + X%. 

3. Goto step 2, until no local search improvement step exists. 

4. Cover the remaining base elements with 2— sets. 

Figure 2: Algorithm A2. 

APX, the cover returned by the algorithm, is a local optimum. The following observation is 
trivial: 
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Observation 3.1 Every feasible solution SOL is of size j < \SOL\ < ^. Consequently, 
\APX\ = @(\OPT\), \OPT\ = @(\APX\). 

Note that this observation implies that if \OPT\ < ^, then APX is also an optimal 
solution. We use the following definition for convenience: 

Definition 3.1 The i—sets in OPT are called i— columns; the i—sets in APX are called 
i— rows. We simply use columns and rows in places where their size is irrelevant or clear 
from the context. 



3.1 Restricting the input type 

In order to analyze the performance of Algorithm A2, we assume, as in the previous section, 
that the input collection satisfies the subset closure property, and that feasible solutions 
consist of disjoint subsets. We also continue to assume Assumption 12. 1\ i.e., that the input is 
APX U OPT, where APX n OPT = 0. The next assumption, which is less trivial, restricts 
the type of instance in the bad examples for the algorithm: 

Assumption 3.1 The instance belongs to one of the following two types: 

• Type A: OPT consists exclusively of 4— columns, APX consists of 2— ,3— and 4— rows, 

• Type B: OPT consists exclusively of 3— and 4— columns, APX consists exclusively of 
2— and 4— rows. 

In order to justify this assumption, we prove the following result: 

Lemma 3.1 Let I be a given instance. Let APX be a local optimum in I, let SOL be an 
arbitrary (feasible) solution in I with \SOL\ < \APX\, and let p = m^gi ■ Then there exists 
an instance I' having solutions denoted by SOL' and APX' , satisfying: (i) APX' is a local 
optimum in I' achieving the same approximation ratio, i.e., p' = rg^p | = p, (it) SOL' 
contains no 2— columns, (Hi) SOL' contains no 3— columns or APX' contains no 3— rows. 

Proof: Recall that APXnSOL = by assumption. We refer to SOL's sets as columns. Given 
/, we construct the new instance I' in two phases. In Phase 1 we eliminate the 2— columns 
in SOL (if any); in Phase 2 we try to eliminate the 3— columns in it. We begin by describing 
Phase 1. Denote by the number of 2— and 3— rows in APX, and by nf 0i the number 

of 2— columns in SOL. We may assume that ^{^3} > 1) otherwise both APX and SOL are 
optimal solution (consisting entirely of 4— sets). We show how to eliminate minln^f^, nf OL } 
2— columns from SOL. Thus, if nf OZ/ > ^{2^3} we ma y recursively apply this transformation 
to the resulting new instance, until (the new) SOL contains no 2— columns. In addition, the 
approximation ratio, p, remains the same. 

Let C be a collection of min{n J ^'^y , nf OZ/ } 2— columns in SOL (if nf OL < n^f^y then 
it is unique). Then for each 2— column c 6 C, there exists a distinct 2— or 3— row in APX 
which we denote by r c . Let /' be the instance in which each c G C is extended to a 3— column 
c' = cU {x c } and r c £ APX is extended to r' c = r c U {x c }, where Xq IS Si distinct new base 
element corresponding to c. These extended sets will be referred to as new. Sets from which 
new sets were obtained will be called source sets. 
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Construct from APX a feasible solution for I 1 by replacing each source row by the new 
row extending it. Denote the resulting collection by APX' . Similarly, construct SOL' from 
SOL by replacing each source column in SOL by the new column extending it. That is, SOL' 
contains new 3— columns obtained from source 2— columns in SOL ; APX' contains new 3— 
and 4— rows obtained from source 2— and 3— rows in APX. 

We show that APX' is a local optimum in Suppose to the contrary that this is not 
so. Then there exist a row collection T' C APX' , and a subset collection S' consisting of 
columns and (possibly, by the subset-closure assumption) of sub-rows of T' satisfying: (i) 
\T'\ < and (ii) replacing T' by S' improves the objective function value. More specifically, 
for j € {2,3,4}, denote by t'j and s'j the number of j— sets in T' and S' , respectively. Then 
by assumption: 

4< + 4 < 4s 4 + 4 . (11) 

Let T C APX consist of the source (2— and 3— )rows from which the new (3— and 4— )rows 
in T' were obtained, and of all the remaining non-new rows in T' . Similarly, let S consist 
of the source (2— )columns from which the new (3— )columns in S' were obtained, and of 
all the remaining non-new columns in S'. Let 7773,7774 be the number of new 3— ,4— rows 
in T', respectively (i.e., S has 7773 + 771,4 source (2— )columns which were extended to new 
(3—) columns in S'). Thus, 

4 = U + m 4 , t' 3 = t 3 + 7773 - 7774, s 4 = s 4 , s' 3 = s 3 + 7773 + 7774 . (12) 

Using (fTT|) and (fl~2|) . we obtain: 

4*4 + ts = 4(4 — 7774) + t 3 — 7773 + 7774 = 44 + 4 — m 3 ~~ 3 m 4 
< 4s 4 + s 3 — 777 3 — 3777,4 = 4s 4 + S3 — 2t77 4 < 4S4 + S3 , 

that is, 4*4 + *3 < 4s4 + S3. But this implies that the algorithm can replace T by S in / and 
improve the objective function. This is a contradiction to APX being a local optimum in /. 
Finally, since \APX'\ = \APX\ and \SOL'\ = \SOL\, it follows that p' = p. Thus, at the end 
of Phase 1, properties (i),(h) stated in the Lemma hold. 

We now proceed to describe Phase 2. The idea is similar to that of Phase 1, but with 
two differences: first, the new rows which are used to cover the new base elements in the new 
(4— )columns are only 4— rows (extending 3— rows in APX). (This is so because extending a 

2— row in APX to a 3— row may result in a non-local optimum); second, let n 3 px (n§ OL ) 
denote the number of 3— rows (columns) in APX (SOL). Then this time, as opposed to what 
we did in Phase 1, if n 3 px < nf OL , we cannot repeatedly perform the transformation on the 
new instance, since it is possible for a local optimum to contain no 3— rows. Thus, SOL' - 
the new solution constructed from SOL, is only guaranteed to have min{nf px ,n 3 OL } less 

3— columns than SOL. 

With a slight abuse of notation, we let / denote the instance resulted from Phase 1, with 
APX and SOL its corresponding solutions, and let I' denote the new instance which we 
construct in this phase, with APX' and SOL' its corresponding solutions. 

Let C be a collection of min{n 3 px ,n 3 OL } 3— columns in SOL. Thus, for each c £ C, 
there exists a distinct 3— row in APX, denoted r c . Define I' to be the instance in which each 
c G C is extended to the new 4— column c' = cU {x c } and r c £ APX is extended to the new 

4— row r' c = r c U {x c }, for a new distinct element x c . 

As was done in Phase 1, construct APX' (SOL') from APX (SOL) by replacing source 
sets by the new sets extending them. That is, SOL' contains new 4— columns extending 
source 3— columns in SOL ; APX' contains new 4— rows extending source 3— rows in APX. 
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We show that APX' is a local optimum in I'. If this is not the case, there exists T' C 
APX', with \T'\ < \ that can be replaced by a collection S' consisting of columns and subsets 
of rows, improving the objective function value. That is, using the notation t'- and s'j from 
before, the inequality (fTT|) holds. 

Let T C APX (S) consist of the source 3— sets in APX (SOL) from which the new sets 
in T' (S') were obtained from, and all the other non-new sets in T' (S 1 ). Let m be the number 
of new columns in S', which is equal to that of the new rows in T' . Thus, 

^4 = ^4 + t 3 = t 3 — in, 54 = 54 + m, s' 3 = s 3 — in . (13) 

Using (fTTj) and (fl"3j) . we obtain: 

4t 4 + t 3 = 4(4 - m) + 4 + m = At' A + t' 3 -3m < As' 4 + s' 3 - 3m = 4s 4 + s 3 , 

that is, 4t4 + 1 3 < 4^4 + 53 - contradicting the fact the APX is a local optimum in I. Finally, 
we have \APX'\ = \APX\, \SOL'\ = \SOL\, implying that p' = p. 

At the end of Phase 2, the constructed instance i 7 with its corresponding solutions APX' 
and SOL' satisfy properties (i),(ii),(iii). ■ 

Note that 4X4 + X 3 , the objective function of Algorithm A2, does not take into account 
the number of 2— rows (as the algorithm only uses them to cover the remaining elements 
that were failed to be covered by 3— or 4— rows). This observation motivates the following 
terminology, which we make solely for convenience: We will refer to the base elements which 
are covered by 2— rows as uncovered. 

Once again, we use a factor revealing LP to bound the approximation ratio of the algo- 
rithm. That is, our goal is to formulate an LP whose objective function value is an upper 
bound on the worst case approximation ratio of A2 (denoted by p). We treat each of the two 
instance types separately. 

3.2 Bounding p in Type A-instances 

In this subsection we assume that the instance is of Type A, that is, OPT consists exclusively 
of 4— columns, while there is no restriction on APX. We use the following notation: 

Definition 3.2 For given OPT and APX , let Oi.j be the set of columns in which i elements 
are covered by 4— rows and j elements are covered by 3— rows, < i + j < 4, and let X^j = 
j^py| be the proportion of Oij -columns in OPT. 

Observe that all Ajj's are non-negative and that they sum up to 1. We would like to express 
the objective function of set cover in terms of these new variables. We do so using a simple 
pricing method: as each row of APX costs 1 and as the rows are disjoint, an element covered 
by an i— row costs i, i = 2,3,4. Thus, an Ojj-column costs 

^j = \i + \j + \^-i-j) , 0<i+j<4. (14) 

Therefore: 

\APX\ =X 2 + X 3 + X A = J2 c ij\°iJ I = E CijXijlOPT} . 

0<«+j<4 0<«+i<4 
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Dividing by \OPT\ gives the approximation ratio of the given instance, which is Y^ij c i,j-^i,j- 
Thus, p = max j J2 i j CijXij (the maximum taken over all legal instances), so our LP's 
objective is: 

max ^2 c i,j x i,j ■ (15) 

0<«+j<4 

In order to bound this function, we derive additional linear constraints. Our goal is to bound 
the Xjj's with the highest Cj j coefficients. In light of our pricing scheme, this is interpreted 
as not buying too many expensive columns. Starting by considering the most expensive ones, 
the following constraints are easy to establish: 

Lemma 3.2 For any Type A-instance, Oo,0; Oo,i> Oo,2; (3o 5 3, Oi ; o = 0. Equivalently, 
-^0,0)^0,1)^0,2,^0,3)^1,0 = 0. 

Proof: Consider Oq^, i = 0, ...,3. If, by contradiction, Oo,i 7^ $ for some i, then there exists 
a column S with i of its elements covered by 3— rows, and the other elements are uncovered. 
Removing these 3— rows from APX and inserting S would increase ^42's objective function. 
Thus, Oo,« = 0- If Oi,o 7^ O then there exists a column S having one element covered by a 
4— row, which we denote by R, and the other elements are uncovered. Removing R from APX, 
inserting S and the 3— row subset of R: R \ (R D S) (recall the subset closure assumption), 
would again, increase ^42's objective function. In either case we obtained a contradiction to 
APX being a local optimum. ■ 

Among the remaining variables, the two X{ .j's which have the largest coefficients in the 
objective function of the LP are, according to (|14p . X\^\, with ci t i = j|, and X2.0, with 
C2,o = § • We would like to obtain an upper-bound on them, using a linear inequality. For 
this purpose, we use an intersection graph. 

The intersection graph G 

With a little abuse of terminology we will refer to APX, OPT, and to subsets of them, as 
both the sets of indices representing the subsets of base elements, and the sets of vertices 
representing them in the following graph. 

For a given instance, let G = (V, E) be a bipartite graph, in which one partite is the set of 
all 3— and 4— row members of APX, and the second partite is OPT. For u a 3— or 4— row in 
APX and v £ OPT there are / (parallel) edges connecting u and v if the intersection of (the 
subsets represented by) u and v consists of I base elements. Thus, for v € Oij, degc{v) = i+j- 
G is the intersection graph corresponding to APX and OPT, or, the intersection graph of the 
given instance, where APX is a local optimum and OPT is an optimal solution of that 
instance. 

Let G be an intersection graph of a given instance, and let F = (V(F), E(F)) be any 
induced subgraph of G. Denote by Of; the columns in F which are in Oij, and denote by 
nf and nf the number of rows and columns in F, respectively (i.e., nf = \V(F) n APX\, 
nf = \V(F) n OPT\). Also let n c = , n r = nf. Note that n c = \OPT\, and that 
n r < \APX\ (due to the uncovered elements, i.e., those covered by 2— rows). Finally, F is 
called small if nf <\ — 2, otherwise it is called big. (The reason for defining small subgraphs 
as those of size at most 7—2 rather than - will be clear in the sequel). 

Throughout the rest of the paper, we use 'CC as an abbreviation for 'connected compo- 
nent'. We analyze the performance of Algorithm A2 by considering G's CC's. Recall that 
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when we stated the algorithm, we observed that it is optimal for instances in which an optimal 
solution consists of ^- sets at the most. In terms of G, this is generalized to small CC's: 

Lemma 3.3 Let G be an intersection graph of a given instance, and let F be a small CC of 

G. Then the base elements covered by F's columns are covered optimally by Algorithm A2, 
and |O£ l = n c ' > implying that \OfA = for all ^ (4,0). 

Proof: The algorithm, which has no access to G, performs local improvement steps on col- 
lections of 3— and 4— rows of size at most -. Thus, it can remove all the < - rows of F 
and replace them with F's columns, which optimally cover the base elements in this CC. The 
rest of the claim follows from the fact that the instance is of Type A. ■ 

Our goal is to upper-bound A2's approximation ratio. Since the following analysis can be 
performed componentwise on each of G's CC's, Lemma 13.31 implies that small CC's in G can 
only improve the algorithm's performance, decreasing its approximation ratio. Thus, we may 
assume, without loss of generality: 

Assumption 3.2 The intersection graph G is connected and big. 

We now turn to deal with A^o and X\\. We derive a linear inequality in the Xij vari- 
ables which will be an additional constraint in the LP that we construct. It is derived using 
a special graph, which we construct in two stages. 

The H subgraph 

We define the following subgraph of G, which we refer to as the H subgraph: it is the subgraph 
of G induced by the set of On and 02,0 columns and the set of 4— rows which intersect at 
least one 0% t \ or O2,o column. See an example in Figure El 

Observe that H need not be connected (as opposed to G, by Assumption I3.2|) . Also 
observe that since the only rows in H are 4— rows, each Oi,i vertex has a single neighbor in 
H (i.e., the 4— row intersecting it). We record this fact for future reference: 

Lemma 3.4 Each 0\ t \ vertex is a leaf in H. 

For any subgraph F of H, let A(F) denote the maximum degree of a vertex in F. In addition, 
for A C {0, 4}, let denote the set of row vertices in F of degree i in F for some i £ A. 

We start by investigating the number of 0\ t i vertices in H. The following result implies 
that there cannot be too many of them: 

Lemma 3.5 Let c,d be two distinct column vertices in 0\ t i which belong to the same CC of 

H. Then every c — d path P in H has > - — 1 row vertices. 

Proof: We may assume that c and d are connected by a (simple) path P of minimum length 
among the paths in H connecting a pair of 0\ } \ vertices. Let P = (c, r±,ci, ...,rj_i, 77, d). 
By Lemma 13.41 the 0\ \ vertices are leaves in H. Therefore, the vertices ci,...,q_i are O"2,o 
columns, and n, ...,7*j are 4— rows. Denote by r and s the 3— row neighbors in G of c and d, 
respectively, and define P' = (r, c, r%, ci, ...,rj_i, c/_i, r\, d, s). Observe that P' cannot be a 
cycle: if r = s, then removing (the 3— row) r from APX, and inserting the two 3— column 
subsets c\ri (i.e., c's two uncovered base elements and the singleton cHr) and d\r[ increases 
the objective function by 1, which is a contradiction to APX being a local optimum. Thus, 
P' is path from r to s. If < K the algorithm can replace the rows r, r%, 77, s with 
the columns c, c±, ...,q_i,d, again increasing its objective function, which is a contradiction. 
Thus, > - + 1, implying = — 2 > - — 1. ■ 
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(a) An instance 



(b) The corresponding H 



Figure 3: An example of an instance and the corresponding H subgraph. In (a), the given 
instance is shown, where only the 4— rows intersecting O 2i o U Oi,i are included. A '-' stands 
for an uncovered base element, and a '*' stands for an element covered by a 3— row. Thus, 
01,03 G Oi 5 i, o 2 , 04, 05, o 6 G O 2) o, and qi,...,q 9 £ O^i U O 2 ,o- Also observe that ri,r 5 <E i?r^i, 
r 3 ,r 4 ,r 6 G and r 2 G 



Corollary 3.1 Every small CC of H has at most one Oi t \ vertex. 
As for big CCs, we have: 

Lemma 3.6 Let F be a big CC of H . Then \0[ tl \ = 0(e)n? . 

Proof: Assume that \Ofi\ > 1, otherwise the claim is trivial. Construct a Voronoi diagram 
on the set of F's vertices, with centers being its Oi t \ columns. By Lemma 13.51 an y path 
connecting two distinct such centers has at least - — 1 row vertices. Therefore, each Voronoi cell 



contains at least [^(7 — 1)J vertices. Thus, > | Cf^i I ( 1 



1)J, implying \0[A 



0(eK 



Thus, the Oi t i vertices are "negligible" in H, both in small and big components. We 
proceed to investigate the number of O 2j o vertices. We specify two useful properties of H: 
the first states that small CCs are either double edges (i.e., two parallel edges between a pair 
of vertices), cycles, or trees, and the second is a characterization of a local optimum. 

Lemma 3.7 Every small CC of H is either a double edge, a cycle, or a tree. 

Proof: We prove the claim by showing that a small CC of H cannot include a double edge 
or a cycle as a proper subset. Thus, any small CC which is not a double edge or a cycle must 
be a tree. 

We start by showing that two vertices that are connected by a double edge have no other 
neighbors in H, implying that a CC of H cannot include a double edge as a proper subset. 
Suppose that a column c and a row r are connected by a double edge. Since, by Lemma 
13.41 the Oi 5 i vertices are leaves, it follows that c G O 2) o- Thus, |r Pi c\ = 2 (i.e., r covers 
two base elements of c), and c has no neighbors other than r. So suppose to the contrary 
that r has an additional neighbor d ^ c. If r covers two elements of d, then replacing r with 
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c, d produces a better solution, which is a contraction. Otherwise, |r H d\ = 1 and d has an 
additional neighbor, which we denote by s. Then removing r and inserting c and d \ (d fl s) 
(i.e., the 3-row subset of d consisting of cfs two uncovered elements and the singleton r f~)d) 
again produces a better solution, which is a contradiction. 

In order to complete the proof, we show that a small cycle has no neighbors outside it, 
again, implying that a CC of H cannot include it as a proper subset. Let C be a small cycle 
in H. We show that for each vertex in C, its neighbors in H are precisely its two neighbors in 
C. Again, since 0\ \ vertices are leaves (by Lemma l3.4p . it follows that C's vertices alternate 
between rows and O2,o-columns. By definition, each O2,o column has exactly two (4— )row 
neighbors, hence, they are in C. As for the rows of C, suppose to the contrary that there 
exists a (4— )row vertex r G C that has a neighbor c £ H \ C . First observe that r cannot be 
connected to c by a double edge since in that case, as we just proved, that double edge is by 
itself a CC, which is a contradiction. Thus r covers a single base element of c. Let c' be the 
3— column subset of c consisting of c's two uncovered base elements and (the singleton) r n c. 
As < -, the following local step can be applied: remove C's rows from the current solution 
and insert C's columns and c'. The number of 4— sets in the new solution is the same, while 
the number of 3— sets increases by one. Thus, this step is a local improvement one, which is 
a contradiction. ■ 



Lemma 3.8 Let T be a small subtree of H . (i) If all the leaves in T are (4—)row vertices, 
then their number, iRj^l, is at most 4. (ii)IfT has exactly \Rji^\ = 4 leaves, then T contains 
no Oi^i vertices. 

Proof: Assume A(T) > 2, otherwise the claim is trivial (note that for part (ii), if A(T) < 2 
then T cannot have 4 leaves). Hence ^ 0- 

(i) Since T's vertices alternate between rows and columns, it follows that 

n T r = n T c + 1 . (16) 

To see this, partition T into edge-disjoint paths by the following iterative procedure: start 
with any path P connecting two arbitrary leaves, and mark its vertices. Clearly, = + 1. 
As long as there exist unmarked vertices, choose a minimal (with respect to inclusion) path 
Q connecting an unmarked leaf to a marked vertex u. Note that u G 4 |, i.e., = + 1, 
and since Q is minimal, all of Q's vertices except for u are unmarked. Marking Q's vertices, 
the number of row vertices which are marked for the first time is equal to the number of such 
column vertices. Summing over all paths, we obtain = + 1. 

Observe that for each row leaf r € r ' s neighbor is a column in 02$, since the 0\ : i 

are leaves (by Lemma [3. 4p and A(T) > 2 by assumption. As nj < -, the following local step 
can be applied: 

• remove the nj rows of T from the current solution, 

• insert the n^T columns of T, 

• for each (4— row) leaf r G insert its 3— row subset consisting of the three elements 
which are not covered by r's (O2,o) neighbor in T . 
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Thus, we traded one 4— row for |i?/n| 3— sets. Due to our objective function, we must have 

L {l}l 



< 4, otherwise this step would be a local improvement one, which is a contradiction. 



(ii) Suppose to the contrary that there exists a subtree T C H with = 4 row leaves 

such that Of 1 ^ 0. Denote these row leaves by r\, r&, and let c±, C4 be their correspond- 
ing neighbors. Observe that c\,...,C4 ^ Oi,i, since if q € O^i for some i, then by Lemma 
13.41 it is a leaf, implying that (Vj,Cj) is an isolated edge, contradicting the assumption that 
T is a tree with four leaves. By Corollary 13-H there is exactly one Of 1 vertex, which we 
denote by c. Let r be c's 3— row neighbor, and let T' = (V(T) U {r},E(T) U {(c, r)}). Thus, 
all the leaves in T' are row vertices. It then follows, by exactly the same argument in part 
(i), that nj = +1. Therefore, in T we have: nj = n^. As nj < -, we can remove 
the nj 4— rows and (the 3— row) r, and insert the columns and the four 3— row subsets of 
the leaves: r, \ Cj, % = 1, ...,4. The number of 4— sets remain the same, while the number of 
3— sets increases by 3, which is a contradiction. ■ 

We emphasize that T need not be a CC of H. It may be a proper subset of a CC. If T is a 
CC, the following result holds: 

Corollary 3.2 Let T be a small CC of H which is a tree. Then \Rj l} \ < A, \RT 3A y\ < 2. 
Consequently, \Rj 2 \\ > nj — 6. 

Proof: The leaves of T are either Rjiy or Oi t i vertices. If all of them are RTn vertices, then 

by Lemma 13.81 (i): I^Tnl < 4. Otherwise, Corollary 13.11 implies that T contains exactly one 

O11 vertex. Deleting it from T, we obtain a subtree T' whose all leaves are the Rjiy vertices. 

By Lemma [32] (i): \RT l} \ = \R^ } \ < 4. 

For the second part, recall from Graph Theory that the number of leaves in a nontrivial 
connected graph G with ni vertices of degree i, i = 1, A(G), is bounded by: 

A(G) 

ni < 2 + J2 (« " 2 >i ■ ( 17 ) 

i=3 

(This follows from J^H? in i = 2 \ E (G)\ > 2(\V(G)\ - 1) = 2(J2t=f ] n A - 1)). If G is a tree, 
then (|17p holds as an equality, which we apply to T and obtain: 

\ R \i} 1 = 2 + |-R{3} I + 2|i?|4} I . (18) 

We then conclude that: 

|-ft{3,4}l = I -^{3} I + \ R {4,}\ ^ \ R {3} \ + 2 I- R {4}I = \ R {1} \ - 2 < 2 . 



(The last inequality follows from the first part). Thus 4 | 



< 2. 



Corollary 13.21 implies that most of the rows in a small tree T have degree 2, i.e., they 
are the RJ2} vertices. This is intuitive, as we can view these rows as "links" connecting two 
columns in a "chain", while very few rows are "end-rows" (namely, the Rjiy ones), and even 
fewer rows are "links" to other "chains" (the ^{34} ones). Observe that for F a cycle or a 
double edge, it is trivial that all the rows have degree 2, i.e. \R^y \ = \R{3 4}! =0, |-R^ 2 }I = n r- 
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For the big CCs of H, the dominance of the R{2} rows still holds, but in a weaker sense. 
In order to establish it, we look at small neighborhoods around the vertices of a big CC F, 
bound the number of vertices of degrees 3 or 4, and by summation obtain a bound on \Rf 3 4 j |. 
A bound on |-R/n| then follows naturally. 

Definition 3.3 For e > and v G H, let B e {v) be the neighborhood of radius ^ centered at 
v in H, i.e., the set of all vertices u in H such there exists a u — v path in H of length at 
most jt. 

Observe that may be greater than -. In addition, it is possible for a "boundary" 

vertex u G B e {v) that deg Be ^(u) < degu{u) } i.e., if its distance from v is exactly ^. 

Lemma 3.9 For any v G H , B e (v) contains at most two vertices of degree 3 or 4 in H , i.e., 

\B e (v)nRf m \<2. 

Proof: Suppose to the contrary that there exists v G H such that B e (v) contains at least 3 
vertices in R 1 ^ 4 |. Pick any three of these vertices and denote them by v i, t>2, ^3- Let B' be a 
spanning tree of B t (v), and let Pi be the v — V{ path in B' , i = 1,2, 3. Let T be the subtree of 
B' defined by T = ljf=i Pi- We first show how to augment T to obtain a subtree T 1 C B £ (v) 
with at least 5 leaves which are column vertices: 

• Case 1: T has at least two leaves in {vi,v%, i^}, say v\ and V2- Then each of v\,V2 has 
at least two neighbors which are not in T, and in addition, either v is a leaf or v% has 
at least one neighbor which is not in T. These neighbors are distinct and are different 
from v, otherwise H contains a small cycle as a proper subset, contradicting Lemma 
13.71 Let T' be the tree obtained by adding the edges connecting these neighbors to T. 
Then T' has at least 5 leaves, which are columns. 

• Case 2: T is a simple path from v to (say) v\. then v\ has at least two neighbors which 
are not in T, and each of ^2)^3 has at least one neighbor which is not in T. These 
neighbors are distinct by an argument similar to that in Case 1. Let T' be the tree 
obtained by adding the edges connecting these neighbors to T. Then again, T 1 has at 
least 5 column leaves (v being one of them). 

In both cases, we obtained a tree T' of size at most ^ + 5, which we clearly may assume to 
be less than i, with at least 5 leaves which are column vertices. Now, since T' is small, it 
follows by Corollary 13.11 that among these 5 column leaves, at most one is an Ox,i column 
vertex. Thus, at least 4 leaves are O2,o columns. Each such 02,0 leaf has an additional row 
neighbor outside T' . Again, these neighbors are distinct, otherwise there is a contradiction 
to Lemma 13.71 Adding the edges connecting these row neighbors to T', we obtain a tree of 
size at most ^ + 10 < -. It either has 5 or more row leaves, or exactly 4 row leaves and one 
0\ : \ leaf. In both cases we obtain a contradiction to Lemma 13.81 ■ 

We are now ready to upper-bound the number of vertices of degree 1, 3, and 4 in the big CCs 
of H. In particular, this establishes the dominance (in terms of a lower bound) of rows of 
degree 2 which we previously stated. Since, as we mentioned, we look at each CC separately, 
all bounds are in terms of the total number of rows in the specific CC. 



16 



Lemma 3.10 Let F be a big CC of H. (i) \R^^\ = 0(e)n^ , (ii) \Rf-a\ = 0(e)nf ', (Hi) 
|i?f 2} | > (1 - 0(e))<. 

Proof: For part (i), observe that: 

1^3,4}!= £ 1< £ 5e[S 6 (t;)|, 

where the inequality follows from the fact that for v in a big CC, |-B e (u)| > ^. 

Now, consider the multiset of vertices which belong to the (possibly overlapping) neigh- 
borhoods around all of R^ 3i j vertices, that is, we look at S = U ve RF Be{v) where we allow 

repetitions of elements in S. Every vertex appears at most twice in S. To see this, suppose 
to the contrary that there is a vertex u which appears at least three times in S. Then any 
three centers of neighborhoods which cover u are three R^^ vertices in B e (u). F is a CC 
of H, therefore Rf 3 4 | C R? 34 \, implying \B e (u) fl -R{^ 4 }| > \B e (u) H -R^ 34 }| > 3, which is a 
contradiction to Lemma [3791 Hence, J2 v &Ft F \Be(v)\ = \S\ < 2np. Combining this with the 
previous inequality, we obtain: 

\Rf 3A} \ < Wen F . (19) 

We would like to obtain the bound in terms of n^, the number of rows in F. Observe that 

each column intersects at most 4 rows. Thus, < 4ra^, implying that np = + < 5n,f . 
Substituting this in (fl"9|) . we obtain: 

\R{ 3 ,4}\ < 50enf . (20) 

This proves part (i). 

For part (ii), applying (fT7|) to F, we obtain: 

\Rf 1} \ < 2 + \Rf 3} \ + 2\Rf 4} \ < 2 + 2[i?f 3)4} | < 2 + lOOeraf < 102enf , 

where the third inequality follows from (|20p . and the last one from the assumption that F is 
big. This proves part (ii). 

Part (iii) follows from parts (i) and (ii) (as = \R?iy \ + \R?2}\ + \ R {3 4}D- This completes 
the proof. ■ 

Now consider the Of}) columns for some big CC F of FL. We show that their number is 
about the same as that of R{ 2 } vertices. Intuitively, this is true since, as we proved, most of 
F's columns are in Ofg, most of its rows are in -Rj^j, and in every path the vertices alternate 
between rows and columns. Formally: 



Lemma 3.11 For a big CC F of H: \Rf 2} \ - 0{e)n^ < |O£ | < \Rf 2} \ + 0(e 



Proof: Let F be a big CC of H. We bound 5TJi=i ^l-^ml fr° m below and from above to 



L {2}i ^v c ;"r — 1^2,01 — r L {2} 

Let F be a big CC of H. We 

obtain: 

2|i2f 2} l < E^«l ^ \ R {i}\ + 2 \ R {2}\ + 4 l jR f3,4}l < 2\R{2}\ + 0(eK , 

i=l 



n, 



17 



where the last inequality follows from Lemma l3.10l fi) . (ii) . Counting F's edges using each of 
its two partite sets, we obtain: 

J2i\R^ } \ = IC^I + 2|O£ | = 2|O 2 F | + 0(e)n? , 
i=i 

where the last equality is by Lemma 13.61 Thus: 

2\R( 2} \ < 2|O£ | + 0(e)nf < 2\Rf 2} \ + 0(e)nf . 

Subtracting 0(e)n^ from all sides and dividing by 2 yields the claim. 
■ 

We note that for small CC's, the last result holds in a stronger sense: 

Remark 3.1 Let F be a CCofH. (i) If F is a small tree then \R¥ 2} \-1 < |O£ | < |i?£, } | + 5. 
(ii) If F is a small cycle or a double edge then \O 20 \ = |-R|^|. 

Proof: (i) Let F be a small CC of H which is a tree. First suppose that F contains no 
Oi t ± vertices, i.e., all of its leaves the Rfn vertices. Then equality (fT6j) holds for F, i.e., 
= + 1 (this is true by the argument used in the proof of Lemma [3.81 (i)). Thus, 
lOfol = n c = n r ~ 1- We now bound |Ofol f rom above and from below: 

|O 2 F | = nf-l<|i?f 2} | + 5, 

where the inequality follows from Corollary 13.21 and trivially: 

|O 2 F | = n r F -l>|i?f 2} |-l. 

Thus, — 1 < \0 2 fi\ — \R{2} \ + ^> as required. 

If Of 1 7^ then by Corollary 13. 1\ F contains exactly one 0\ \ vertex. We delete it from F 
to obtain a tree, denoted F', with all its leaves being R^y vertices. Thus, the last result holds 

for F', i.e., — 1 < \O 2 ' \ < \Rf 2 \\ + 5. By observing that O 20 = O 2 ' and Rf-n = -^{2}' 

we establish the result for F as well. 

(ii) This is trivial. ■ 

Recall that our goal is to bound and X 2) o - the proportions of On and O 2) o in G. So 
far we obtained a good estimation of their proportions in H: Corollary 13.11 and Lemma 13.61 
imply that the Oi t \ vertices are negligible in small and big CCs of H, respectively ; Lemma 
I3.11l and Remark 13.11 imply that intuitively, the proportion of O 2j o in H is about one half (the 
other half consists mainly of rows of degree 2). However, in order to bound the proportions 
in G, we need to take into account the columns which are not in 0\ t \ or O 2i o but intersect 
some row in that CC. This motivates the following construction: 

The H graph 

Denote those columns which intersect some row in H and are not in On U O 2; o by O. We 
construct the H graph, which need not be a subgraph of G, in two steps. First, let H be the 
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Figure 4: The H graph corresponding to the instance given in Figure [3] prior to the addition 
of the Hq subgraph. 



graph obtained from H by connecting each row in H to distinct new vertices representing 
the O-columns intersecting it. We emphasize that an O-column may appear in a few CCs 
of H: Suppose o € O and C\,...,Ci, 2 < I < 4 are CCs of H such that each one contains 
a row intersecting o. Then each corresponding CC in H will have its own (distinct) vertex 
representing o. Thus, in terms of vertex labels (where each vertex has a label of the set 
represented by it), the CCs of H are not O-column-disjoint. But they are 0\,\ U 02,0-column- 
disjoint as well as row-disjoint, and hence in particular are edge-disjoint. Figure d] shows the 
H graph corresponding to the instance given in Figure [3] up to this step. 

At a final step in the construction, we add to H the subgraph of G induced by the set 
of remaining vertices (if any), that is, all vertices which do not belong to any CC from the 
previous step. Denote this subgraph by Ho. Note that H$ need not be connected. Let C 
denote the collection of H 's CCs. The disjointness of the rows implies: 

n r = nf = n r ■ ( 21 ) 

FeCUH 

We use the following notation. Let F £ C U Hq. Denote by O f the O-columns in F. Denote 
by E20 the set of edges incident to the Ofo columns, by E[ 1 the set of edges incident to 
the Ofi columns, and by E F the set of remaining edges, i.e. E F = E{F) \ (E F U E F t ). 
Note that for F £ C (F ^ Hq), all the edges in E F are incident to columns in O f . Define 
-^2,0 = UfgC E F t Q, Ei t x = Ufgc an d E = Uf<=c E F ■ Finally, denote by rf the number of 
rows in V{F) n Rfiy The following observations are trivial (the first two hold in H as well): 

Lemma 3.12 (i) For each F G C U H , \E[ A \ = \Of A \ and \E% \ = 2\O F \, (ii) For each 
F € C, each vertex o G Of 1 is a leaf in F , (in) For each F G C, a row in F which belongs to 
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contributes i edges to E F ^ U E F and 4 — i edges to E F , i = 1, ...,4, that is: 

I^Col + \ E [l I = »f + 2rf + 3r F + 4rf , (22) 
= 3rf + 2rf + rf . (23) 

Lemma 3.13 For any Type A-instance, 

\E\ < |0 li2 | + |0 1)3 | + 2|0 2il | + 2|0 2j2 | + 3|0 3 ,o| + 3|0 3 ,a| + 4|O 4 , | • (24) 

Proof: Consider a vertex o € Oij-, i > 1> (^J) ^ {(1, 1), (2, 0)}. o € O if there exist an 
O 2 ,o-column q and a 4— row r such that r n g 7^ and rHo / 0. In this case, o contributes 
at most i edges (possibly in different CCs) to E. (Otherwise it contributes zero). ■ 

We now derive a linear inequality, which provides an upper-bound on the number of edges 
in £2,0 and -Ea,i. 

Lemma 3.14 For any Type A-instance, |-E 2 ,o| + 3|.E7i,i| < \E\ + 0(e)n r . 
Proof: We show that for every F 6 C U Hq, we have: 

|£ 2 F ol + 3|£i F il < + 0(e)nf . (25) 

By definitions of £2,0) ^11 an< i £, and using (|2ip . the claim then follows by summing over 
all F G C U Hq. We distinguish four cases, according to the type of F. 

• Case 1: F = H 

Since Hq contains no On and O 2j o vertices, it follows that Ej_° = E^q = 0- Hence (|25p 
trivially holds. 

• Case 2: F is a CC of H obtained from a double edge or a small cycle in H 

Denote by C the double edge or the cycle in H from which F is obtained. Then C 
is a CC of H, and its rows are precisely the rows of F (because H was obtained from 
H by adding columns). C has even length, with its vertices alternating between O 2j o 
columns and R 1 ^} (4— )rows. Thus, each such row has two O 2 ,o column neighbors (in C 

and therefore in F) and two O column neighbors (in F\C). Therefore, it contributes 
two edges to E^q and two to E F , i.e.: 

\El \ = \E F \ = 2n F . (26) 

Finally, observe that O f 1 = 0: this is true because as we just noted, C's columns are 
only in O 2) o, and F was obtained from C by adding O columns (which, by definition, 
are not in On). Thus, |Of\| = 0, implying that = 0. This fact and (|26p establish 

• Case 3: F is a CC of H obtained from a small tree in H 

The rows of F are precisely the rows of the tree in H which F is obtained from. Thus, 
subtracting (j22|) from (f23|) . we obtain: 

\E F \ ~ \E F \ - 1^1 = 2(rf - r F - 2r F ) = 4 , 
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where the last equality follows from rf = 2 + rf + 2r,f , which holds due to (|18p . This 
implies: 

l^ 2 F ol<l^ F |-4. (27) 

Now, by Corollary 13.11 F can have at most one On vertex. By Lemma 13.121 (ii), such 
a vertex is a leaf in F, implying that \E F 1 | = |Of\| < 1. Combining this with (I27p . we 
obtain: 

l<ol + 3|<i| < |<ol + 3 < \E F \ - 1 , 

establishing ([25]) . 

• Case 4: F is a CC of H obtained from a big CC of H 

We have: 

\E F \ > 2r F > 2\O F \ - 0(e)n F = \E F \ - 0{e)n F , 

where the first inequality follows from (|23p . the second inequality follows from Lemma 
13.111 and the equality follows from Lemma 13.121 (i) . In order to complete the proof, it 
suffices to show that \E F X \ = 0(e)n F . Denote by F' the (big) CC of H from which F 
is obtained. By Lemma I3T61 we have \0 F 'i\ = 0(e)n F ' . Since Of^ = Of x and similarly, 
the rows of F 1 are precisely the rows of F, we also have: |Of\| = 0(e)n F . By Lemma 
KT2\ (i): \E[ A \ = \Of A \. Hence \E[ A \ = 0(e)n F , as required. 

■ 

We are now ready to bound a linear combination of and Xi i: 
Lemma 3.15 For any Type A-instance: 

2X 2fi + 3X1,! < X 1>2 + X lj3 + 2X 2<1 + 2X 2 , 2 + 3X 3fi + 3X 3 ,i + 4X 4 , + O(e) . (28) 

Proof: From Lemma 13.141 we have: |i?2,o| + 3|£'i i i| < \E\ + 0(e)n r . From Observation 13.11 
we obtain n r = G(n c ), so we also have: |i?2,o| + 3[-Ei,i| < \E\ + 0(e)n c . We would like to 
write this inequality in terms of the column sets. By summation, Lemma l3.12( i) implies that 
1-^1,1 1 = \Oi,i\ and |-E2,o I = 2|02,o I ■ Thus, we obtain: 

2|O 2)0 | + 3|O 1 ,i| < \E\ + 0(e)n c . (29) 

Using Lemma l3.13( we obtain: 

2|Oa )0 | + 3lOi.il < |Oi, a | + |Oi, 3 | + 2\0 2}1 \ + 2|0 2)2 | + 3|0 3 ,o| + 3|0 3 ,i] + 4|0 4 ,o| + 0(e)n c . 

Dividing both sides by n c = \OPT\, we obtain the required inequality. ■ 

By providing the last constraint, Lemma [3.151 concludes our construction of the LP, which 
upper-bounds p - the approximation ratio of the algorithm (for Type A- instances). Recall 
that the other constraints are that the variables are non-negative and that their sum is 1. 
In addition, the variables Xo,o, Xo ; i, Xo,2, Xo j3 , X\$ are zero, by Lemma [3721 The objective 
function was stated in (|15p . Thus, the complete program is: 



max 2~2o<i+j<4 c i,j^i,j 

s.t. 3Xi,i - X 1>2 - Xi, 3 + 2X 2 ,o - 2*2,1 - 2* 2 ,2 - 3*3,0 - 3*3,1 - 4*4,0 < 0(e) (30) 
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2Jo<i+i<4 Xij — 1 

Xij > < i + i < 4 

Xo,o, X ^i, X 0t 2, X ^, Xifi = 

(inequality (f30|) is obtained from (f28l) by rearranging terms). Specifically, given e > 0, the ratio 
p is upper-bounded by the objective function value of the LP. We now turn to solve this pro- 
gram. We simplify it, first by omitting the zero variables -X^o, Xq^, -X~o j2 , Xq^, X\fl. Denote 
the set of (remaining) relevant indices by I = {(1, 1), (1, 2), (1, 3), (2, 0), (2, 1), (2, 2), (3, 0), (3, 1), (4, 0)}. 
Next, since our goal is to solve the LP for arbitrarily small values of e, we replace O(e) in the 
constraint ([30]) by zero. Using (fT4"J) to obtain the explicit values of Cjj's, the modified LP is: 

max + J^Xi^ + §^1,3 + f -^2,0 + 3^2,1 + |-^2,2 + f ^3,0 + l|^3,l + -^4,0 

s.t. 3X hl - X h2 - X lr3 + 2X 2j0 - 2X 2jl - 2X 2 , 2 - 3X 3 , - 3X 3 ,i - 4X 4 , < (31) 
Xi,j = 1 (32) 

In order to solve this LP, we use the dual program. Let y, z be the dual variables correspond- 
ing to constraints (|3"Tj) . (|32l) . respectively. The dual program is then: 

min z 

s.t. 3y + z > i| 

-V + z > % 

-V + z>\ 

2y + z>l 
-2y + z>l 
-2y + z>l 
-3y + z > f 
Sy + z > § 
-4y + 2: > 1 

y >o 

Let X* be the vector consisting of X\,\ = 3,^1,2 = f, and Xjj = for all € 7 \ 

{(1, 1), (1,2)}. It is clear that X* is a feasible primal solution. The corresponding objective 
function value is c^X^ + c^Xfa = ±§ • \ + g • | = ||. Let = Jt is 

straightforward to verify that it is a feasible dual solution. The corresponding objective 
function value is z* = ||, which is equal to that of the primal. Thus, from the duality 
theorem, we conclude that X* and (y*,z*) are optimal solutions to the primal and dual 
programs, respectively. By the construction of the (primal) LP, we conclude the following 
result: 

Theorem 3.1 For Type A-instances, A2 is a (p + e)- approximation algorithm for (2,4)- 
UUSC, where p < § = 1.458333... 
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3.3 Bounding p in Type B-instances 

In this subsection we assume that the instance is of Type B, that is, OPT consists of 3— and 
4— columns, and APX consists of 2— and 4— rows. We use the analogous notation to that of 
the previous section. 

Definition 3.4 For given OPT and APX, let Of be the set of '4— columns in which i elements 

lo 4 l 

are covered (by 4— rows), i = 0, 4, and let Xf = \ qpt\ be the proportion of these columns in 
OPT. Similarly, let Of be the set of 3— columns in which i elements are covered (by 4— rows), 
i = 0,...,3, and let Xf = -ML. For any graph F , let 0-' F = Of n V(F), 8 = 3,4, i = 0,...,s. 

The objective function of set cover in terms of these new variables is: 

4 3 

\APX\ =X 2 + X 3 + X 4 = Y, 4\Of\ + 4\Of\ (33) 

4=0 1=0 



where 



and 



A i 4 — i i 
«^4 + — = 2-4, » = 0,-..,4 



4 = 7 + = o - 7 1 i = 0,...,3. 



i 3 — i 3 i 



4 2 2 4 
Explicitly, the column costs are: 

(cg,...,c|) = (2,1.75,1-5,1.25,1) , (4,.. .,4) = (1.5,1.25,1,0.75) . (34) 

Observe that c\ = c^o from the previous section (i = 0, ...,4). The objective function of our 
LP, which bounds p from above, is: 

4 3 

m8^J2 c t X t + J2 c i X f ■ ( 35 ) 
i=0 i=0 

Considering the highest c^'s (i.e., the costs of the most expensive columns), the following 
result is analogous to Lemma 13.21 and therefore its proof is omitted: 

Lemma 3.16 For any Type B-instance, Oq,Oq,0\ = 0. Equivalently, X^X^Xf = 0. 

The next highest coefficient is c\ = 1.5, so we derive a bound on X|. The intersection graph 
G is defined exactly the same, and we assume that it is connected and big (i.e., Assumption 
13.21 holds for this instance type as well). Formally, it consists of 3— and 4— columns in the 
OPT partite, and 4— rows in the APX one. As for H and H: 



The H subgraph 

H is the subgraph of G induced by the 02" commns anci t ne (4—) rows intersecting them. Note 
that these columns are analogous to the O2,o columns of Type A-instance, while there is no 
analog to 0\ t \ columns. Thus, -ff's structure is the same, that is, H obtained from a Type B- 
instance is a special case of H obtained from a Type A-instance, with no On columns. Thus, 
the results from the previous section hold trivially. Specifically, regarding the H subgraph, 
Lemmas 13.41 [331 and [3761 are irrelevant, Lemma 13.71 holds. Lemma 13.81 (i) holds (part (ii) is 
irrelevant), Lemma 13.91 holds, and Lemma 13. 101 holds. The analog of Lemma [3.111 is: 
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Lemma 3.17 For any Type B-instance, for each big CC F: \R{ 2 }\ ~ 0(e)n F < \0 2 ' F \ < 
\R F 2} \ + 0{e)n F . 

The H graph 

H is, again, similar to H from the previous section, but with no columns analogous to 
Oi^\. Specifically, let O be the set of columns which intersect some row in H (i.e., a 4— row 
intersecting some 2 column). For each CC F of H, connect each row in F to distinct vertices 
representing the O-columns intersecting it. Denote these vertices by O f . Let E be the set 
new edges used to connect those vertices. Also, let -Ho denote the subgraph of G induced by 
the remaining vertices (which include all the 3— rows), and add it to H. Finally, let E 2 ' F , 
E F , E 2 denote the set of edges incident to 2 F , O f , 0\ vertices, respectively. The analog 
of Lemma 13.121 is (only parts (i) and (iii) are relevant): 

Lemma 3.18 (i) For each F G C U H , 

\E i /\ = 2\O i /\ , (36) 

(ii) For each F £ C, a row in F which belongs to R?^ contributes i edges to E 2 and 4 — i 
edges to E F , i = 1, 4. 

The analogs of Lemmas 13.131 and 13.141 are, respectively: 
Lemma 3.19 For any Type B-instance, 

\E\ < \Of\ + 2|Of | + 3|Of | + 3|Of| + 4|0|| , (37) 
Lemma 3.20 For any Type B-instance, 

\El\ < \E\ + 0(e)n r . (38) 

(The proof of Lemma 13.201 is identical to that of Lemma 13 . 141 with substituting E 2 F for E F 
and for E? A ). 

Using (I36p and summing over all F G C U Hq, we obtain: 

\El\= E \ E 2 F \= E 2|0|^| = 2|0|| . (39) 
FeCuH FeCuH 

Now, substituting (f39j) in the left-hand side of (f38l) . and ([371) in its right-hand side, and using 
n r = B(n c ) (from Observation 13. ip . we obtain: 

2|0|| < \0\ | + 2\C*l | + 3|0|| + 3[Of | + 4|0|| + 0(e)n c . 

Dividing by n c = \OPT\, we obtain the analog of Lemma 13.151 

Lemma 3.21 For any Type B-instance: 



2X| < Xf + 2Xl + 3Xl + 3Xl + \X\ + 0(e)n c 
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Using (f34"j) , the inequality from Lemma 13.211 and substituting Xq , Xq , Xf = (by Lemma 
I3.16p . we obtain the following LP, which upper-bounds p for Type B-instances: 

max 1.5X1 + 1.25Xf + X\ + 1.25X? + Xf + 0.75Xf 
s.t. 2Xf - 3Xf - 4X1 - *i - 2 ^2 - 3Xf < 

xf + xf + xl + xf + x| + xf = 1 

-^2 ) -^3 ! "^4 i ^1 ) "^2 i -^3 - 

The dual program is: 

min z 

s.t. 2y + z>1.5 
-3y + z > 1.25 
-4y + z > 1 
-y + z > 1.25 
-2y + z > 1 
-3y + 2 > 0.75 

y >o 

It is straightforward to verify that: 

x* = (xf, xf, xf, xf, xf, xf) = (|, 1 0, 0, 0, 0) 

and 

are primal and dual feasible solutions, respectively, achieving the same objective function 
value of g. Thus, they are optimal solution, which implies: 

Theorem 3.2 For Type B-instances, A2 is a (p + e)- approximation algorithm for (2,4)- 
UUSC, where p<\ = 1.4. 

Combining Theorems 13.11 and 13.21 and using Assumption 13. 1\ altogether we obtain: 

Theorem 3.3 A2 is a (p + e)- approximation algorithm for (2,A)-UUSC, where p < || = 
1.458333... . 

In the following, we provide an example for which p = l| = 1.3888... . The instance is 
of Type A. Let \OPT\ = 36m for any fixed m, that is, OPT consists of 36m 4— columns, 
denoted Oi,...,036 m , covering n = 144m base elements. The construction of a local op- 
timum APX is as follows. The 4— rows in APX consist of two sets: In the first one, 
for each i = l,...,12m — 1, there is a 4— row which intersects (i.e., covers a single ele- 
ment of) the four columns O^i+i, and there is one additional 4— row intersecting 
Oi, 036m-2, 036 m _i, Os6 m . Thus, the first set contains 12m rows. In the second set, for each 
i = l,...,3m, there are two 4-rows: one intersecting O^-i, 9m+3i -i, Oi8m+3i-l, 027m+3i-l, 
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and another one intersecting O^i , Og m+ 3j , Ois m +3i , 2 j m+ 2,i ■ Thus, the second set contains 
6m rows, so the total number of 4— rows in APX is X^ = 18m. 

As for the 3— rows in APX, for each i = 1, ...,4m, there is one 3— row intersecting 
03j_i,Oi2 m +3i-i,0 2 4m+3i-i, and another one intersecting 3i , Oi 2m+3i , 2 4 m+3i . Thus, the 
total number of 3— rows is A3 = 8m. 

For a given e > 0, taking m large enough ensures that APX is a local optimum. Using the 
pricing scheme, it is easily verified that the 12m columns: 03i_2 ; i = 1, 12m, are in 02,0; 
and the remaining 24m columns are in 02,1- Hence X 2 ^q = h, A" 2j i = I. The corresponding 

3 _ _ 4 
2' 02 3' 



costs are, by (fT4"|) . c 2) o = f , c 2 ,i = f • The obtained approximation ratio is therefore: 



3 1 4 2 25 
c 2)0 X 2)0 + c 2)1 X 2>1 = -.- + -._ = _. 



4 Concluding remarks 

In this paper we focused on a special case of the unweighted fc-set cover problem. We proposed 
a new paradigm to approach instances of this problem, and we showed that it gives better 
results than the previous known algorithms for unweighted fc-set cover. Our proof is for a 
restricted case in which the instance contains all the pairs of elements. The technical reason to 
consider this special case is that all previous known improvements over the greedy algorithm 
have a special treatment of singletons, which makes the algorithms and their analysis much 
more complicated. By neglecting this technical problem, we can concentrate on the way to 
handle the selection of large sets. 

In this paper we showed that the non-oblivious local search methodology can outperform 
the other methods to approximate unweighted k-set cover, and we conjecture that this is the 
case for the generalized case and not only for (2, 4)-uniform instances. We leave as major 
open problems the tuning of the parameters for the non-oblivious local search algorithm (i.e., 
the weights used in the objective function of the local search), as well as the analysis of the 
resulting algorithm for unweighted fe-set cover. 
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We prove that the solution for (P) stated in the theorem is optimal, and then compute its 
objective function value. In order to show optimality, we construct the dual program of (P), 
denoted (-D), provide a feasible solution to it, and then use a complementary slackness argu- 
ment. By the complementary slackness, we conclude that both solutions are optimal. Then, 
we compute the objective function value of the primal solution. We start by constructing 
(D). The dual decision variables are: 

• Pp+i, —,Pk ' correspond to the set of constraints flSJ), 

• (3 P - corresponds to constraint ([9]), 

• 7p+i, ...,7/c - correspond to the set of constraints (fTOl) . 
The dual program is: 

Program (D) 



A Proof of Theorem 12.11 



min J2i= p Pi 

s.t. h-jik>i 

(3 k -(k- 2) 7fc > 1 



i = i 



...,k- 1 



(40) 
(41) 



Pi - in + hi+i > j{t+T) 

Pi-ii- 2)7; + > -r^j 

P P + P7p+i > p^i 
A,7i>0 



% = p + 1 



i=p+l,...,k-l, j = l,...,i 



...,k- 1 



- 1 



(42) 
(43) 
(44) 



i=p,...,k, j=p+l,... 



k . 



For this LP, the primal variables a^i, a>k,k-i correspond to the set of constraints (f4"0|) . 
afc ; fc corresponds to (|4T|) . a^j, i = p + l,...,k — 1, j = — 1 correspond to ([4^]) . a^j, 

i = p + 1, ...,k — 1 correspond to ([15]). and a p , p corresponds to ([S]) . 
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The dual solution is the following (it is the same for the two cases distinguished in (P), 
depending on the parity of k — p): 

— i 
7* ~ fc(fc-i) 

7fc-l = 

7i = 7i+2 + j(j + i)( i+2 ) > for all i = p + 1, A; - 2 

ft = l + (fc-2)7fc (45) 

A = - »7i+i + (i - 2 )7i j for alH =p + 2, 1 

Pp+l = (p+lf(p+2) +P7 P +i -P7 P +2 

& = pTT " P7p+i • 

We proceed to verify that the primal and dual solutions which we constructed are indeed 
feasible. In addition, we identify the set of tight constraints. 

Lemma A.l (Primal Feasibility) 

The primal solution stated in Theorem \2.1\ is feasible for (P). Moreover, the set of constraints 
(Ep ; (0i and [TU\) are tight, except for [T0\) for the value of i = k — 1 when k — p is odd. 

Proof: First, it is trivial that the non- negativity constraints are satisfied since our solution 
is 0/1. The set of constraints ([8]) are satisfied, and tight, since for each % = p + 1, ...,k, there 
exists exactly one j index such that Oj j = 1 and for all other j values a% = 0. Similarly, the 
constraint Q is tight, as a PtP = 1. As for the set of constraints (flT)}) . which for convenience 
we rewrite as: 

p+i p+i 

^3 a p+i,j -^2j a p+i+i,j -(p + l- l)a P +m, P +m < , l = 0,...,k-p-l , 

3=1 3=1 
we distinguish: 

• Case 1: k — p even: 

— For even values of I, < I < k — p — 2, the first sum is p + 1 since a p+ i tP+ i = 1 (and 
all other terms are zero), the second sum is also p + I because a p+ i + i :P+ i = 1, and 
the last term is zero. Thus, the left-hand side is zero and the constraint is tight. 

— For odd values of I, 1 < I < k — p — 1, the first sum is p + 1 — 1 since o p+ ; jP+ /_i = 1, 
the second sum is zero, and the last term is p + / — 1 since a p +;+i, p +/+i = 1. The 
constraint is tight. 

• Case 2: k — p odd: 

— For even values of/,0<Z<fc — p — 3, the first sum is p + / since a p+ i :P+ i = 1, 
the second sum is p + I because a p+ i + \ ;p+ i = 1, and the last term is zero. The 
constraint is tight. 

— For odd values of/, 1 < I < k — p — 4 the first sum is p + 1 — 1 since Op+jj )P +j_i = 1, 
the second sum is zero, and the last term is p + I — 1 since a p+ i + i^ p+ i + \ = 1. The 
constraint is tight. 

— For I = k — p — 1, the first sum is A; — 2 since ak-i^-2 = 1; the second sum is zero, 
and the last term is A; — 2 since ak,k = 1- The constraint is tight. 
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— For I = k — p — 2, the first sum is A; — 3 since ak-2,k-3 = 1> the second sum is k — 2 
because a^i^-2 = 1 and the last term is zero. Thus, the left-hand side is —1, so 
the constraint is satisfied but not tight. Observe that this case corresponds to the 
value i = k — 1 in the original formulation (jlOp . 



We next consider the feasibility of the dual solution given by (|45p . First of all, it is trivial 
that 7j > 0, i = p + 1, k. Next, by straightforward substitution, it is easily verified that 
the dual constraints (|4ip and (|44p are tight. For the other constraints, we use the following 
auxiliary calculations: 

Lemma A. 2 For i = p + 1, k — 1 : 

li + H+i = ■(■ ) Ts , (46) 
i(i + 1) 

7m - 7* < ttt-^-t • (47) 
l{l + 1) 

Proof: The first part is proved by induction: The case i = k — 1 is immediate since 7fc_i = 
and 7^ = £prrjy- Assume that (JM]) holds for i, p + 2 < i < k — 1. Then for i — 1, we have: 

7i_i + 7j - = 7i+i + + 7» - 

= 7i + 7i+l - i^Jiy 

= o, 

where the last equality holds by the induction hypothesis. For the second part, observe that 

1 1 

7i+i + 7i > .,. ; T\ ~ 7i+l - 7i = > 



where the inequality holds since ji > and the equality is by (|46p . The result follows. ■ 
Lemma A. 3 (Dual Feasibility) 

The dual solution defined by |^ 5| j is feasible for (D). Moreover, the tight constraints are 
|7iP,(f7^; &44\)> an d (4W f or i an d 3 values such that j = i — 1. 

Proof: We first identify the tight constraints in {D). Consider the set of constraints (|43p . 
For i values i = p + 2, k — 1, it is easily seen that they are tight, by the definition of 
For i = p + 1, substituting the definition of we obtain 

p 1 

1 7 P +i + 7 P +2 > 



(p+l)(p + 2) ' ' '^"p + 2 • 

By dM|), the inequality is tight. 

Consider the set of constraints (|42|) . for j values j = i — 1. From /3 p +i's definition, it 
follows immediately that the case i = p + 1 (hence j = p) is tight. For i > p + 2, substituting 
/3j's definition yields: 

1 » - 1 

Again, (|4^|) yields that it is tight. 
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We are done identifying the tight dual constraints. We now turn to verify feasibility for 
the rest of the constraints. Consider the set of constraints (|40p . Substituting the definitions 
of 0k and 7^, we obtain: 

k-j-2 j 

1+ HJ—^-k ' i = i.~>*-i- 

The inequality clearly holds for j < k — 2. For j = k — 1 it evaluates to 1 — k ^_^ > ^r 1 , 
which holds, as k > 2. 

Consider the set of constraints (|42p . for i=p + l,...,fc — 1 and j = 1, i — 2 (we have shown 
that cases for the values j = i — 1 are tight). For i = p + 2, k — 1, we evaluate /3j's definition 
to obtain: 

A = 7x1 - *7i+l + (» - 2)7» 

= 5+1- ICiTT) - (* - + (* - 

= i^J - («' - !hi+l + (» - 1)7, > , 

where the third equality follows from (|46|) and the inequality follows from (|47|) . This proves 
that for i = p + 2, fc — 1, the constraints (|4"2"j) hold, and also that $ > 0. For z = p + 1 (and 
j = 1, ...,p — 1), we evaluate /? p +i's definition: 

/W = (p+1 f (p + 2) + P7p+1 " P >+ 2 ^ ° ' 

where again, the inequality follows from (|47p . This establishes that (|42p holds for p + 1 
(j = 1, ...,p — 1) and that /3 p +i > 0. 

It remains to show that (5 P and j3j~ are nonnegative. As 7^ > and k > 2, it immediately 
follows from the definition that > 0. For /3 p , we use (jUJ) to obtain: 

A> = pTI ~ PTjH-i — ^7 ~~ p(7p+i + 7 P +2) 
- J E - 2 >f) 

p+1 (p+l)(p+2) (p+l)(p+2) - U • 



We now show that the solutions (a PiP , a P 4-i,i, a p+ i )P+ i, Op+2,1, a p+2 , p +2, «fe,i, a&,fc) 
and (f3 p , /3k, 7 p +i, 7fe) satisfy the complementary slackness conditions with respect to 
programs (P) and (-D). Thus, they are both optimal. 

Lemma A. 4 (Primal and Dual Optimality) 

The primal solution (a PjP , 0^+1,1, a,p+\ iP +i, a p +2,i 5 ap+2,p+2 5 a ifc,i> ■••> a fc,jfc) stated in The- 
orem\2j\ is optimal for (P). TTie ditaZ solution ((3 p , (3k, 7p+i> 7fe) defined by |75| ) is 
optimal for (D). 

Proof: Consider (P). By Lemma I A. 11 all of (P)'s constraints are tight except for (110p for 
the value of i = — 1 (and when k — p is odd). But the dual variable corresponding to this 
constraint, Jk-i, is zero. Hence, all primal complementary slackness conditions are satisfied. 

Consider (D). From Lemma |A.3( it follows that the constraints which are not tight are 
(I40p . and (I42p for the case j = 1, ...,i — 2. The primal variables corresponding to (]40p are 
aki, afc fc-i, and are all zeros. Hence the conditions are satisfied. The variables correspond- 
ing to ([4"2"]1 for j = 1, i — 2 are a^j, i = p + 1, k — 1, j = 1, ...i — 2. All of them are zeros, 
so again, the conditions are satisfied. Therefore, all dual complementary slackness conditions 
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are satisfied. Since the complementary slackness conditions hold, the primal (as well as the 
dual) solution is optimal. ■ 

We now compute the primal objective function, denoted POF. This time we distinguish 
four cases, depending on the parity of both k and p. In each case, we substitute the primal 
solution in the objective function. 



Case 1: p even, k even (the term 1 is for 



P0F = ^^(^j TT) + 



M , 2j 



, ,2j(2j+l) (2j+l)(2j+2) / 
fc " 



1 + E^(^T + 



k'\ 

>2 _i 1 



(2j+lj(j+l) 



1 + i+i 

H k -Hp+1 . 

a 2 



Case 2: p even, k odd (the first two terms are for a^, dk-i,k-2 respectively): 

fc-3 . 

rur - 1 + fc(fc _ 1} + ^2i(2j+l) + (2j+l)(2j+2) 



fc-3 

/ I , 

(2j+l")(j+l) 



1 + fc fc(fc-i) + ^i=f (2A1 + 

1 1 k 1 

1 + k ~ k(k-i) + ^i=f J+T 

Hk-l — Hp + 1 + -r 



2 



Case 3: p odd, k even (the first two terms are for a& ak-1^-2 respectively): 



p OF _ 11 fc-2 1 V l~ 2 f 2j+l , 2j+l 

U - f fc(fc _ 1} f 2-, i=£ -i V(2j+l)(2i+2) ^ (2j+2)(2i+3) 

fc r, 

x ^ fc fc(fc-l) ~ l^j = E=l 2j+3 



1 1 — — 1 ^ 

1 + fc ~ fc(fc-l) + 2 Z)| = £±l 2J+T 



- 2(H k - H p+1 ) ~Hk+ He+i + 1 + \ - . 
where the last equality follows from the straightforward identity: 

£ r = H 2r+2 - H 2 i - \ (H r+1 -Hi), for all / < r . (48) 
Case 4: p odd, odd (the term 1 is for a^): 

fc — 3 

POF = 1 + £, Vl ( ( 2i+ 2 /)|2i+2) + — 



j= £zi V(2j+l)(2j+2) "T (2j+2)(2j+3) 
fc-3 

1 _i_ V 2 2 

1 ^ Z -j = £^i 2j+3 
fc-l 

1 + 2 ^j=£±i 2J+T 

2(i7fc_(_i — Hp+i) — Hk+i + H P +i + 1 . 



where again, we used (|48j) . 
This completes the proof of Theorem 12.11 



